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Abstract 

Let STS{n) denote the number of Steiner triple systems on n ver- 
tices. Our main result is the following upper bound. 



STS{n)< ({1 + 0(1))^ 



2 



The proof is based on the entropy method. 

As a prelude to this proof we consider the number F{n) of 1- 
factorizations of the complete graph on n vertices. Using the Kahn- 
Lovasz theorem it can be shown that 

/ n \ — 

F(n)< ((1 + 0(1))^) ' . 

We show how to derive this bound using the entropy method. Both 
bounds are conjectured to be sharp. 



1 Introduction 

A Steiner triple system on a vertex set is a collection of triples T C 
such that each pair of vertices is contained in exactly one triple from T. It 
is well known that a Steiner triple system (STS) on n > 1 vertices exists if 
and only if n = 1 or 3 (mod 6). 
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A 1-factorization of the complete graph on n vertices Kn is a partition 
of the edges of Kn into n — 1 perfect matchings, or in other words, a proper 
edge coloring of Kn using n — 1 colors. It is well known that a 1-factorization 
of Kn exists if and only if n is even. 

It has been observed (e.g., [3]) that 1-factorizations and Steiner triple 
systems are special types of Latin squares. We view a Latin square as an 
nxnxn array A with — 1 entries in which each line has exactly one element 
that equals 1. To see that this description of Latin squares is equivalent to 
the usual definition, we associate to the array A a matrix L, that is defined 
via L{i,j) = k where k is the unique index for which A{i,j,k) = 1. A 1- 
factorization is a Latin square A such that A{i,j, k) = 1 A{j, i,k) = 1 and 
A{i,i,n) = 1 for all i. Thus, L is a symmetric matrix in which all diagonal 
terms equal n. A Steiner triple system is a Latin square A where A[i, j, k) = 1 
implies that A{a{i),a{j),a{k)) = 1 for every permutation a e 5*3 on i,j,k, 
and A{i, i,i) = 1 for all i. This can also be expressed in terms of L, though 
it's a bit more complicated to formulate. 

These relations suggest that there might be deeper analogies to reveal 
among Latin squares, STS's and 1-factorizations. Indeed, we have recently 
proved an asymptotic upper bound on the number of Latin hypercubes [9], 
and here we prove analogous statements for STS{n) and F{n). 

The best previously known estimates for the number of n-point Steiner 
triple systems are due to Richard Wilson [12]. 

— 

Wilson also conjectured that, in fact, STS{n) = ((l + o(l))^) . We 
show that this is an upper bound on the number of Steiner triple systems. 

Theorem 1.1. ^ 

STS{n)<[{l + o{l))^y^ . 

The Kahn-Lovasz theorem considers a (not necessarily bipartite) graph 

with degree sequence ri, r„. It shows that the number of perfect matchings 

1 

in such a graph is at most YYi=i i^i^-) ^"^^ ■ particular a regular graph has 
at most ((i!)53 perfect matchings. For a proof see Alon and Friedland [1]. 
These results are inspired by Bregman's proof [2] of Mine's conjecture on 
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the permanent. For a very recent proof of this result that uses the entropy 
method, see [6]. 

This theorem easily yields an upper bound on F{n) as follows: Choose 
first a perfect matching of Kn- The remaining edges constitute an n — 2 
regular graph in which we again choose a perfect matching. We proceed 
to choose perfect matchings until we exhaust all of E{Kn). The theorem 
implies that we have at most ((n — A;)!) choices for the k-th step, so 
that F{n) < YYdZl [dl)^- An application of Stirling's formula gives: 

Theorem 1.2. 



It is an interesting question to seek lower bounds to complement these 
upper bounds. We have already mentioned Wilson's lower bound on STS{n). 
Cameron gave a lower bound for F{n) in [4]. When done with care this 
argument yields 



For the sake of completeness we repeat his argument which starts with the 
inequality F{n) > L(|)(F(?i/2))^, where L{n) is the number of order-n Latin 
squares. This inequality is shown as follows: Partition the vertex set [n] into 
two equal parts, and select an arbitrary 1-factor on each. It is well-known 
and easy to prove that a 1-factorization of Kr^r is equivalent to an order-r 
Latin square. It follows easily from the van-der-Waerden conjecture that 
L{n) > (-^^-^^^T^)"^ (see [11]). The derivation of Cameron's lower bound is 
a simple matter now. We note that this argument works when n is divisible 
by 4. When n = 4r + 2 some additional care is required. 

Conjecture 1.3. 



,2 
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STS{n) = ((1 + 0(1)) 



) 
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Our proofs are based on the entropy method, a useful tool for a variety 
of counting problems. The basic idea is this: In order to estimate the size 
of a finite set J-", we introduce a random variable X that is uniformly dis- 
tributed on the elements of J-". Since H{X) = logdJ-"!), bounds on H{X) 
readily translate into bounds on \J^\. The bounds we derive on H{X) are 
based on several elementary properties of the entropy function. Namely, if 
a random variable takes values in a finite set S then its entropy does not 
exceed log \ S\ with equality iff the distribution is uniform over 5". Also, if X 
can be expressed as X = (Yi, . . . , Y^), then H{X) = H{Yj\Yi, . . . , Yj^i). 
The expression X = {Yi, . . . , Yk) can be viewed as a way of gradually reveal- 
ing the value of the random variable X. It is a key ingredient of our proofs 
to randomly select the order -< in which the variables Y^ are revealed and 
average over the resulting identities H[X) = H{Yj\Yi s.t. i -< j). Similar 
ideas can be found in the literature, but to the best of our knowledge this 
method of proof is mostly due to Radhakrishnan [10]. We deviate somewhat 
from the standard notation in that our logarithms are always natural, rather 
than binary. Formally, we should use the notation He for the entropy func- 
tion, but to simplify matters, we stick to the standard notation H{X). We 
refer the reader to [5] for a thorough discussion of entropy. For an example 
of the entropy method, see [10]. 

In section 2, we give an entropy proof of theorem 1.2. Using similar 
methods, in section 3 we give an entropy proof of theorem 1.1. 

2 An upper bound on 1-factorizations 

Let n be an even integer, and let X be a random, uniformly chosen 1- 
factorization of Kn- Define the random variable Xij = Xj^i to be the color of 
the edge {i,j} in X. In order to analyze these random variables we first se- 
lect a random ordering, denoted of the vertices. Using the relation <^ we 
introduce next a random ordering -< of the edges as follows: For each vertex 
V we choose a random ordering of E^, the set of edges {v, u} where v <^u. To 
define the ordering -<, we scan the vertices in the order <^. For each vertex 
V, we scan the edges {m, v} G in their chosen order. Our proof proceeds 
by successively revealing the colors of the edges, i.e. the values taken by the 
variables Xi j, where the edges are exposed in the order -<. 

Given two vertices i 7^ j, we are interested in the (random) number of 
colors which are available for the edge given the values taken by the 
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-<-preceding edges. We are unable to determine this number exactly. Rather 
we define a random variable Ni^j that is an upper bound on this number. If 
j <^ z, then the variable Xij is determined by the preceding variable 
so in this case it is natural to define Ni^ = 1. We proceed to the more 
interesting case where i <^ j. Here are two reasons why some color may be 
unavailable for Xij. For every vertex t ^ z we already know the colors of 
the edges {t,i} and {t,j}, neither of which can be used for the edge {i,j}- 
The set of colors that are ruled out for this reason is denoted Aij. It is also 
possible that i <^ k and {i,k} -< so that cannot take the color 

Xi^k- The set of such colors is denoted Bij. Formally: 

• Aij := {Xt,i\t < z} U {Xtjt < z}. 

• Bij := {Xi^k\i < k and {i, k} -< {i,j}}. 

The set of colors that are not ruled out for the first reason is denoted: 

Mi J := [n - 1] \ A, J 
and those remaining after further forbidding colors due to the second reason: 

:= Mi J \ Bij. 

As mentioned, we seek to define a random variable Nij that is an upper 
bound on the number of possible values for Xj j given the -<-previous edge 
colors. To this end we define Nij as the cardinality of N'ij- As it turns out, a 
cruder upper bound on the number of possible values for Xij is useful as well. 
Namely, one that takes into account only the colors of the edges involving 
vertices that <S-precede i. This is accomplished by the random variable Mij 
which is defined as |A^j,j|. 

Fix an ordering -<. We apply the chain rule for the entropy function and 
conclude that 

log(F(n)) = H{X) = Y,H{X,JX, : e -< < 5^Ex[log(A^,,,)]. 

Next we take the expectation with respect to the random choice of the 
order -<. 

log(F(n)) < E^[J]Ex[log(Ar,,,)]] = ^ Ex[E^[log(iV,,,)]]. 
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Fix a 1-factorization X and a pair i ^ j. If j ^ i, then log(A'^j j) = 0. The 
probabihty that i <^ j is |, so that 

log(FH) < i5^Ex[E^|K<,[log(iV^,,)]]- 

A natural approach is to bound the expectation E_.|j^j[log(Mj .,■)] using 
Jensen's inequahty. As it turns out, this yields a somewhat weaker upper 
bound. Rather we argue as follows: 

E^|,«,[log(M,,,)] =E«|,«,-[log(M,,,)] = 

i<<j[log(M,j)]] < Ep|i«j-[log(E«|p, i<<j[Mij])] (1) 

For the first equality note that Aiij depends only on the ordering <^. 
Next we condition on p, the position of i in ^ and then, finally we resort 
to Jensen's inequality. In order to bound this expression it is necessary to 
understand the distribution of p and the expectation of Mjj given p. 

Lemma 2.1. The probability that i occupies the p-th position in given 
that i j is 

n — p 
2— 

n[n — 1) 

Proof. We are sampling uniformly from among the y permutations in which 
i <^ j. To specify such a permutation in which i is in the p-th position, we 
must assign j to one of the n—p positions following i. There are (n — 2)! ways 
to order remaining elements with a total of {n—p){n — 2)\ such permutations. 
The conclusion follows. □ 

Lemma 2.2. E«|,, ,«,[M,„,] = 1 + in-p-i)in-P-2) 

Proof. Now we are sampling uniformly from among the {n — p)(n — 2)\ per- 
mutations in which i is in the p-th position and i j. If Xij = s, then 
clearly the color s belongs to Mij- This corresponds to the 1 term in the 
lemma. For any other color t ^ s, let a (resp. b) be the unique vertex such 
that Xi^a = t (resp. Xj^b = t). Clearly, t G J^ij iff i ^ a, b. But 

T-, /■ 7I-- • •• ■ -N (n — p — l)(n — p — 2) 

rrfz ^ a, bh is m position p,i j) = -, z-, r . 

(n — l)(n — 2) 

There are n — 2 colors t ^ s and the conclusion follows. □ 
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Using lemmas 2.1 and 2.2, we have 

Ep|i«j[log(E«|p, i^j[Mij] 



n-l 



o n-p j^g^^ ^ (n-p- l)(n-p-2) . 
^ n(n-l) 



2 frlogd ■ 



i{n — 1) n — 1 



= - ^ r log(l + + 0(1) = - ^ r log(-) + o(l). 

The function rlog(^) is unimodal, and its minimum is achieved at r = 
Therefore 

""1 ^u^\ fn 

> rlog(— )</ Mlog — c/m + 2^. 
^ n Jo \n J e 



Thus, 



Ep|i«i[log(E«|p,i«j[Mij])] < — / Mlog — + o(l) 



logn-l + o(l). (2) 

We next proceed to consider colors that are ruled out due to variables 
that correspond to edges in Ei. An edge {z, k} may rule out additional colors 
if Xj G M.ij- There are Mj ,,■ — 1 such edges, one for each color in M.i j\Xi j. 
Consequently, we are only interested in counting such edges that -<-precede 

E^|,«,[log(iV,,,)] =E^|M,,,«i[log(iV.,,)] = 
5^Pr(M„- = /)E^|M,,=^.«,[log(iV„)] = 

5^Pr(M,, = = 5^Pr(M„- = /)(log/ - 1 + o(l)) = 

I I 

E^|,«,[log(M,,,-)] - 1 + 0(1) < logn - 2 + 0(1). 
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We used the fact that given Mj = m, the number A^j j of possible values 
for the random variable Xi j is uniformly distributed between 1 and m. In 
the final step we used Equations 1 and 2. 

Consequently, 

log(F(n)) < ^ 5^(logn - 2 + o(l)) = (logn - 2 + o(l)) 
which yields the bound 

F(n)< ((1 + 0(1))^) 

3 An upper bound on the number of Steiner 
triple systems 

The ideas here are similar, but the details are different. 

Let X be a uniformly chosen random Steiner triple system on n vertices. 
Define = Xj^i to be the unique vertex k such that {i, j, /c} is a triple in 
X. As before, we define next a random ordering <^ on the vertices and a 
random ordering -< of the edges. 

Fix a Steiner triple system X, orderings <^ and -< and a pair of vertices 
i ^ j- Let Xi^j = k. We want to define a random variable Nij that is an 
upper bound on the number of vertices that are available for Xij, given the 
values of the preceding variables. Let Fij denote the event that i <^ j,k 
and -< {i,k}. Clearly, Pr(Fij) = |. If Fij doesn't occur, then Xij is 

uniquely determined by the preceding variables, so in this case we define Nij 
to be 1. 

Let t 7^ Xij be a vertex. We consider two classes of reasons for which t 
may be ruled out as the value of Xij given the previously revealed choices. 
The first is the union of the following three events: t <^ Xi^t i and 
Xj^t ^ i- Namely, the variables corresponding to vertices that <^-precede 
i reveal a triple that includes t and either i or j, so that t} cannot be 
a triple in X. The second possibility is the union of the events {i,Xj^t} -< 
and {i,t} -< where the revealed triple {i,t,Xi^t\ rules out the 

possibility that {z, j, t} is in X. 

We define the set of vertices which are ruled out for Xj^- due to the first 
reason: 
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Among the remaining vertices we consider those that are unavailable due 
to the second reason 



Bij := {t ^ Ai^j\{i,t} ^ {i,j} or ^ {z,j}}. 

Further, 

Mi,j := {V \ \ Aij. 

Mi J := Mi J \ Bij. 

As before we define Nij as the cardinality of A/ij. Also, let Mjj := |A^ij|. 

The random variable Mij gives an upper bound on the number of values 
that are still available for Xij given the values of the random variables that 
involve vertices that ^-precede i. Likewise, Nij is an upper bound on the 
number of possible values for Xij when all -<-preceding choices are known. 

For a given ordering -< we derive: 

log{STS{n)) = H{X) = J2h{X,JX, : e -< < ^ E^[log(A^, J]. 

We take the expectation over the random choice of -< to obtain 
log(^T^(n)) < ^Ex[E^[log(Ar,J]]. 

Let us fix X and a pair i ^ j and turn to bound E_.[log(A^jj)]. With 
probability | there holds log{Nij) = 0, so that 

E^[log(Ar,,,)] =Pr(i^.,,)E^|F.,[log(A^„-)] = [log(A^„-)]. 

Clearly, A^jj depends only on the ordering <^. If p is the position of i in 
<^, then 

E^I^,,Jlog(M,,)] = E«|p^,Jlog(M,J] = 

EpiF., [E«|,,,.,, [log(M,,)]] < Ep\F.,, [log(E«|,,,.,^. [M,,])]. (3) 

The last inequality follows from Jensen's inequality. We next analyze the 
distribution of p and the expectation of Mjj given p. In the following lemmas 
we denote Xij by k. 
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Lemma 3.1. The probability that i occupies the p-th position in given 
Fij , is 

^ {n-p){n-p - 1) 
n(n — l)(n — 2) 

Proof. We are sampling <C uniformly from among the y permutations in 
which i precedes j and k. To specify such a permutation in which i is in the 
p-th position we place j in any of the n — p positions following z, and then 
place k in one of the n — p — 1 remaining positions following i. The remaining 
vertices can be ordered in (n — 3)! ways for a total of {n — p){n — p — l){n — ?>)] 
such permutations. The conclusion follows. □ 

Lemma 3.2. E«|,,^^,JM,,] = 1 + ^""^-^(i^!;)^;?^"-^-^^ 

Proof. Now we are sampling uniformly from the set of orderings in which 
i j,k where i is in the p-th position. Clearly k G Aiij. This corresponds 
to the 1 term. If t G V \ {i, j, k}, let a (resp. b) be the unique vertex such 
that Xi^a = t (resp. Xi^b = t ). The vertex t forms a triple with i and a, and 
a triple with j and b. If an edge from either of these triples is exposed before 
{i,j}, then t is ruled out for Xij. Note that t G Mij iff « -C a, b, t. 
But 

Pr(i ^ a, b, t\i is in position p,i j, k) = 

{n — p — 2){n — p — 3){n — p — 4) 
(n-3)(n-4)(n-5) ' 
There are n — 3 such vertices t, and the conclusion follows. 



□ 



Using lemmas 3.1 and 3.2, we have 

Ep|F,,[log(E«|,,^,_jM,,])] 



n-2 



^ {n - p){n - p - 1) (n-p-2)(n-p-3)(n-p-4) . 
?7,(?7, — l)(n-2) (n-4)(n-5) 

3 ^ , (r-2)(r-3)(r-4), 

2_^r(r - l)log(l + ^ 



n(n-l)(n-2)^ ' ' (n-4)(n-5) 



10 



As in the previous section, the next step is to collect together lower order 
terms and obtain 



J log '^^ + "^(l) — log^ — 1 + 0(1) 



Together with 3, this implies that 

E^|P,^log(M,,,)] = logn - 1 + 0(1). (4) 

We next show that for every 1 < / < n, 

E^I^,_^,M,,=;[log(iV.,,)] = log/ - 1 + 0(1). 

Let q be the position taken by in the uniformly chosen random 

ordering of the edges in and m = \Ei\. Again, by Jensen 

"^AF,,, ,M.,,=l [log(^^j)] < ,M,,j=l [log(E^ ,M.,,=/ [N^,j])]■ 

The following two lemmas describe the distribution of q and the expec- 
tation of Nij given q. We maintain the notation that k is the vertex Xij. 

Lemma 3.3. The probability that {i,j} occupies the q-th position in the 
ordering of Ei, given Fij, is 

2. ("^ - 1) 



raim — V 



Proof. There are ^ orderings of Ei in which {i, j} precedes {i, k}. There are 
m — q possible positions for {i, k} following {i,j}. The conclusion follows. □ 

Lemma 3.4. E,|,^^,^,,M,,=.[iV,,] = 1 + - !)• 

Proof. Now we are sampling uniformly from the set of orderings of E^ in 
which {i,j} precedes {i, k} and {i,j} is in the g-th position. For each vertex 
V, we determine the probability that v G J^ij, and then use the linearity of 
the expectation to obtain the result. We consider only vertices in Aiij. 

Clearly, k G A/ij. This corresponds to the 1 term. If t G Aiij \ {k}, 
then t G A/ij iff {i,j} -< {i, a}, {i, t}, where a is the unique vertex such that 

— But 

Pr({«,j} -< {i,a},{i,t}\{i,j} is in position q,{i,j} -< {i,k}) = 



11 



(m — g — l)(m — g — 2) 
(m — 2)(m — 3) 

There are / — 1 such vertices t, and the conclusion follows. 
Therefore, 

E^|F,,,A/,,=«[log(7V,,)] < 

y 4^ Ml + _ 1)) 

^m(m-l) (m-2)(m-3) ^ 

2 . (r-l)(r-2) 

> r log 1 + ^ - 1 . 



I'm 




I u log 


Jo 





□ 



m(m — 1) (m — 2)(m — 3) 

As above, this is equal to 
2 

Putting all of this together, we have 

n 

E^I^^,Jlog(iV„-)] = Pr (M,, = /)E^|^^,^,M,,=;[log(iV,,)] < 
1=1 ^' 

n 

Pr (M,, = O(log(0 - 1 + 0(1)) = E^|^^,Jlog(M,,)] - 1 + o(l) < 
1=1 ^' 

logn - 2 + o(l) 

Thus, 

log(^T^(n)) < i V (logn - 2 + 0(1)) = 

^|^(logn-2 + o(l)), 
which yields the bound STS{n) < ((1 + o(l))^) ^ as claimed. 
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